Dataset statistics
| Number of variables | 27 |
|---|---|
| Number of observations | 46228 |
| Missing cells | 48847 |
| Missing cells (%) | 3.9% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 9.5 MiB |
| Average record size in memory | 216.0 B |
Variable types
| Numeric | 13 |
|---|---|
| Categorical | 14 |
country has a high cardinality: 150 distinct values | High cardinality |
agent has 5522 (11.9%) missing values | Missing |
company has 43323 (93.7%) missing values | Missing |
previous_cancellations is highly skewed (γ1 = 24.57744197) | Skewed |
df_index has unique values | Unique |
lead_time has 2836 (6.1%) zeros | Zeros |
previous_cancellations has 45857 (99.2%) zeros | Zeros |
previous_bookings_not_canceled has 44761 (96.8%) zeros | Zeros |
booking_changes has 37639 (81.4%) zeros | Zeros |
days_in_waiting_list has 45127 (97.6%) zeros | Zeros |
total_of_special_requests has 21617 (46.8%) zeros | Zeros |
Reproduction
| Analysis started | 2023-01-20 05:17:03.111416 |
|---|---|
| Analysis finished | 2023-01-20 05:18:05.909039 |
| Duration | 1 minute and 2.8 seconds |
| Software version | pandas-profiling v3.4.0 |
| Download configuration | config.json |
| Distinct | 46228 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 92264.34827 |
| Minimum | 40060 |
|---|---|
| Maximum | 119389 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 361.3 KiB |
Quantile statistics
| Minimum | 40060 |
|---|---|
| 5-th percentile | 43378.45 |
| Q1 | 84651.75 |
| median | 96254.5 |
| Q3 | 107822.25 |
| 95-th percentile | 117076.65 |
| Maximum | 119389 |
| Range | 79329 |
| Interquartile range (IQR) | 23170.5 |
Descriptive statistics
| Standard deviation | 21051.41383 |
|---|---|
| Coefficient of variation (CV) | 0.22816412 |
| Kurtosis | 0.5861398278 |
| Mean | 92264.34827 |
| Median Absolute Deviation (MAD) | 11585.5 |
| Skewness | -1.136907785 |
| Sum | 4265196292 |
| Variance | 443162024.2 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 40060 | 1 | < 0.1% |
| 103959 | 1 | < 0.1% |
| 103961 | 1 | < 0.1% |
| 103962 | 1 | < 0.1% |
| 103963 | 1 | < 0.1% |
| 103964 | 1 | < 0.1% |
| 103965 | 1 | < 0.1% |
| 103966 | 1 | < 0.1% |
| 103967 | 1 | < 0.1% |
| 103968 | 1 | < 0.1% |
| Other values (46218) | 46218 |
| Value | Count | Frequency (%) |
| 40060 | 1 | |
| 40066 | 1 | |
| 40070 | 1 | |
| 40071 | 1 | |
| 40072 | 1 | |
| 40073 | 1 | |
| 40075 | 1 | |
| 40077 | 1 | |
| 40078 | 1 | |
| 40082 | 1 |
| Value | Count | Frequency (%) |
| 119389 | 1 | |
| 119388 | 1 | |
| 119387 | 1 | |
| 119386 | 1 | |
| 119385 | 1 | |
| 119384 | 1 | |
| 119383 | 1 | |
| 119382 | 1 | |
| 119381 | 1 | |
| 119380 | 1 |
| Distinct | 384 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 80.70273427 |
| Minimum | 0 |
|---|---|
| Maximum | 518 |
| Zeros | 2836 |
| Zeros (%) | 6.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 361.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 12 |
| median | 50 |
| Q3 | 121 |
| 95-th percentile | 265 |
| Maximum | 518 |
| Range | 518 |
| Interquartile range (IQR) | 109 |
Descriptive statistics
| Standard deviation | 89.8630283 |
|---|---|
| Coefficient of variation (CV) | 1.113506613 |
| Kurtosis | 2.901544293 |
| Mean | 80.70273427 |
| Median Absolute Deviation (MAD) | 45 |
| Skewness | 1.640353121 |
| Sum | 3730726 |
| Variance | 8075.363856 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2836 | 6.1% |
| 1 | 1632 | 3.5% |
| 2 | 1003 | 2.2% |
| 4 | 920 | 2.0% |
| 3 | 903 | 2.0% |
| 5 | 811 | 1.8% |
| 6 | 743 | 1.6% |
| 7 | 641 | 1.4% |
| 8 | 586 | 1.3% |
| 12 | 530 | 1.1% |
| Other values (374) | 35623 |
| Value | Count | Frequency (%) |
| 0 | 2836 | |
| 1 | 1632 | |
| 2 | 1003 | 2.2% |
| 3 | 903 | 2.0% |
| 4 | 920 | 2.0% |
| 5 | 811 | 1.8% |
| 6 | 743 | 1.6% |
| 7 | 641 | 1.4% |
| 8 | 586 | 1.3% |
| 9 | 469 | 1.0% |
| Value | Count | Frequency (%) |
| 518 | 22 | |
| 504 | 21 | |
| 479 | 20 | |
| 478 | 2 | < 0.1% |
| 476 | 15 | |
| 468 | 13 | |
| 465 | 16 | |
| 464 | 4 | < 0.1% |
| 463 | 1 | < 0.1% |
| 462 | 21 |
arrival_date_year
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 361.3 KiB |
| 2016 | |
|---|---|
| 2017 | |
| 2015 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 184912 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2015 |
|---|---|
| 2nd row | 2015 |
| 3rd row | 2015 |
| 4th row | 2015 |
| 5th row | 2015 |
Common Values
| Value | Count | Frequency (%) |
| 2016 | 22733 | |
| 2017 | 15817 | |
| 2015 | 7678 | 16.6% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 2016 | 22733 | |
| 2017 | 15817 | |
| 2015 | 7678 | 16.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 46228 | |
| 0 | 46228 | |
| 1 | 46228 | |
| 6 | 22733 | |
| 7 | 15817 | 8.6% |
| 5 | 7678 | 4.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 184912 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 46228 | |
| 0 | 46228 | |
| 1 | 46228 | |
| 6 | 22733 | |
| 7 | 15817 | 8.6% |
| 5 | 7678 | 4.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 184912 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 46228 | |
| 0 | 46228 | |
| 1 | 46228 | |
| 6 | 22733 | |
| 7 | 15817 | 8.6% |
| 5 | 7678 | 4.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 184912 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 46228 | |
| 0 | 46228 | |
| 1 | 46228 | |
| 6 | 22733 | |
| 7 | 15817 | 8.6% |
| 5 | 7678 | 4.2% |
arrival_date_month
Real number (ℝ≥0)
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.546054339 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 361.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 4 |
| median | 7 |
| Q3 | 9 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.073563744 |
|---|---|
| Coefficient of variation (CV) | 0.4695292133 |
| Kurtosis | -0.9989738283 |
| Mean | 6.546054339 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.03911360686 |
| Sum | 302611 |
| Variance | 9.446794088 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 5381 | |
| 7 | 4782 | |
| 5 | 4579 | |
| 6 | 4366 | |
| 10 | 4337 | |
| 9 | 4290 | |
| 3 | 4072 | |
| 4 | 4015 | |
| 2 | 3064 | |
| 11 | 2696 | |
| Other values (2) | 4646 |
| Value | Count | Frequency (%) |
| 1 | 2254 | |
| 2 | 3064 | |
| 3 | 4072 | |
| 4 | 4015 | |
| 5 | 4579 | |
| 6 | 4366 | |
| 7 | 4782 | |
| 8 | 5381 | |
| 9 | 4290 | |
| 10 | 4337 |
| Value | Count | Frequency (%) |
| 12 | 2392 | |
| 11 | 2696 | |
| 10 | 4337 | |
| 9 | 4290 | |
| 8 | 5381 | |
| 7 | 4782 | |
| 6 | 4366 | |
| 5 | 4579 | |
| 4 | 4015 | |
| 3 | 4072 |
arrival_date_week_number
Real number (ℝ≥0)
| Distinct | 53 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.15953535 |
| Minimum | 1 |
|---|---|
| Maximum | 53 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 361.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 16 |
| median | 27 |
| Q3 | 38 |
| 95-th percentile | 49 |
| Maximum | 53 |
| Range | 52 |
| Interquartile range (IQR) | 22 |
Descriptive statistics
| Standard deviation | 13.56208133 |
|---|---|
| Coefficient of variation (CV) | 0.4993487982 |
| Kurtosis | -0.9898022355 |
| Mean | 27.15953535 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | -0.02033523998 |
| Sum | 1255531 |
| Variance | 183.9300501 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 33 | 1305 | 2.8% |
| 34 | 1263 | 2.7% |
| 21 | 1159 | 2.5% |
| 32 | 1157 | 2.5% |
| 27 | 1121 | 2.4% |
| 30 | 1119 | 2.4% |
| 38 | 1101 | 2.4% |
| 39 | 1098 | 2.4% |
| 28 | 1078 | 2.3% |
| 41 | 1072 | 2.3% |
| Other values (43) | 34755 |
| Value | Count | Frequency (%) |
| 1 | 440 | |
| 2 | 448 | |
| 3 | 488 | |
| 4 | 527 | |
| 5 | 527 | |
| 6 | 589 | |
| 7 | 793 | |
| 8 | 865 | |
| 9 | 808 | |
| 10 | 899 |
| Value | Count | Frequency (%) |
| 53 | 692 | |
| 52 | 412 | |
| 51 | 330 | 0.7% |
| 50 | 540 | |
| 49 | 602 | |
| 48 | 713 | |
| 47 | 703 | |
| 46 | 595 | |
| 45 | 646 | |
| 44 | 979 |
arrival_date_day_of_month
Real number (ℝ≥0)
| Distinct | 31 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.81861642 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 361.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 29 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.729888951 |
|---|---|
| Coefficient of variation (CV) | 0.5518743686 |
| Kurtosis | -1.191748227 |
| Mean | 15.81861642 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | -0.01571243558 |
| Sum | 731263 |
| Variance | 76.2109611 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 25 | 1736 | 3.8% |
| 6 | 1657 | 3.6% |
| 20 | 1653 | 3.6% |
| 2 | 1648 | 3.6% |
| 5 | 1634 | 3.5% |
| 19 | 1616 | 3.5% |
| 18 | 1603 | 3.5% |
| 17 | 1598 | 3.5% |
| 26 | 1587 | 3.4% |
| 24 | 1578 | 3.4% |
| Other values (21) | 29918 |
| Value | Count | Frequency (%) |
| 1 | 1280 | |
| 2 | 1648 | |
| 3 | 1420 | |
| 4 | 1470 | |
| 5 | 1634 | |
| 6 | 1657 | |
| 7 | 1373 | |
| 8 | 1388 | |
| 9 | 1543 | |
| 10 | 1505 |
| Value | Count | Frequency (%) |
| 31 | 817 | |
| 30 | 1265 | |
| 29 | 1427 | |
| 28 | 1532 | |
| 27 | 1495 | |
| 26 | 1587 | |
| 25 | 1736 | |
| 24 | 1578 | |
| 23 | 1537 | |
| 22 | 1346 |
adults
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 361.3 KiB |
| 2 | |
|---|---|
| 1 | |
| 3 | 2997 |
| 0 | 283 |
| 4 | 24 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 46228 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 32499 | |
| 1 | 10425 | 22.6% |
| 3 | 2997 | 6.5% |
| 0 | 283 | 0.6% |
| 4 | 24 | 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 2 | 32499 | |
| 1 | 10425 | 22.6% |
| 3 | 2997 | 6.5% |
| 0 | 283 | 0.6% |
| 4 | 24 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 32499 | |
| 1 | 10425 | 22.6% |
| 3 | 2997 | 6.5% |
| 0 | 283 | 0.6% |
| 4 | 24 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 46228 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 32499 | |
| 1 | 10425 | 22.6% |
| 3 | 2997 | 6.5% |
| 0 | 283 | 0.6% |
| 4 | 24 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 46228 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 32499 | |
| 1 | 10425 | 22.6% |
| 3 | 2997 | 6.5% |
| 0 | 283 | 0.6% |
| 4 | 24 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 46228 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 32499 | |
| 1 | 10425 | 22.6% |
| 3 | 2997 | 6.5% |
| 0 | 283 | 0.6% |
| 4 | 24 | 0.1% |
children
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 361.3 KiB |
| 0.0 | |
|---|---|
| 1.0 | 2014 |
| 2.0 | 1236 |
| 3.0 | 44 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 138684 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 42934 | |
| 1.0 | 2014 | 4.4% |
| 2.0 | 1236 | 2.7% |
| 3.0 | 44 | 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0.0 | 42934 | |
| 1.0 | 2014 | 4.4% |
| 2.0 | 1236 | 2.7% |
| 3.0 | 44 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 89162 | |
| . | 46228 | |
| 1 | 2014 | 1.5% |
| 2 | 1236 | 0.9% |
| 3 | 44 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 92456 | |
| Other Punctuation | 46228 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 89162 | |
| 1 | 2014 | 2.2% |
| 2 | 1236 | 1.3% |
| 3 | 44 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 46228 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 138684 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 89162 | |
| . | 46228 | |
| 1 | 2014 | 1.5% |
| 2 | 1236 | 0.9% |
| 3 | 44 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 138684 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 89162 | |
| . | 46228 | |
| 1 | 2014 | 1.5% |
| 2 | 1236 | 0.9% |
| 3 | 44 | < 0.1% |
babies
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 361.3 KiB |
| 0 | |
|---|---|
| 1 | 297 |
| 2 | 6 |
| 10 | 1 |
| 9 | 1 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.000021632 |
| Min length | 1 |
Characters and Unicode
| Total characters | 46229 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 45923 | |
| 1 | 297 | 0.6% |
| 2 | 6 | < 0.1% |
| 10 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 45923 | |
| 1 | 297 | 0.6% |
| 2 | 6 | < 0.1% |
| 10 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 45924 | |
| 1 | 298 | 0.6% |
| 2 | 6 | < 0.1% |
| 9 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 46229 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 45924 | |
| 1 | 298 | 0.6% |
| 2 | 6 | < 0.1% |
| 9 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 46229 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 45924 | |
| 1 | 298 | 0.6% |
| 2 | 6 | < 0.1% |
| 9 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 46229 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 45924 | |
| 1 | 298 | 0.6% |
| 2 | 6 | < 0.1% |
| 9 | 1 | < 0.1% |
meal
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 361.3 KiB |
| BB | |
|---|---|
| SC | |
| HB | |
| FB | 9 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 92456 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | HB |
|---|---|
| 2nd row | HB |
| 3rd row | HB |
| 4th row | HB |
| 5th row | HB |
Common Values
| Value | Count | Frequency (%) |
| BB | 35638 | |
| SC | 6601 | 14.3% |
| HB | 3980 | 8.6% |
| FB | 9 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| bb | 35638 | |
| sc | 6601 | 14.3% |
| hb | 3980 | 8.6% |
| fb | 9 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 75265 | |
| S | 6601 | 7.1% |
| C | 6601 | 7.1% |
| H | 3980 | 4.3% |
| F | 9 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 92456 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 75265 | |
| S | 6601 | 7.1% |
| C | 6601 | 7.1% |
| H | 3980 | 4.3% |
| F | 9 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 92456 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| B | 75265 | |
| S | 6601 | 7.1% |
| C | 6601 | 7.1% |
| H | 3980 | 4.3% |
| F | 9 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 92456 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| B | 75265 | |
| S | 6601 | 7.1% |
| C | 6601 | 7.1% |
| H | 3980 | 4.3% |
| F | 9 | < 0.1% |
| Distinct | 150 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 361.3 KiB |
| PRT | |
|---|---|
| FRA | |
| DEU | |
| GBR | |
| ESP | |
| Other values (145) |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.9911089 |
| Min length | 2 |
Characters and Unicode
| Total characters | 138267 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 28 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | PRT |
|---|---|
| 2nd row | PRT |
| 3rd row | PRT |
| 4th row | PRT |
| 5th row | PRT |
Common Values
| Value | Count | Frequency (%) |
| PRT | 10879 | |
| FRA | 7081 | |
| DEU | 5012 | |
| GBR | 3753 | 8.1% |
| ESP | 3285 | 7.1% |
| ITA | 2054 | 4.4% |
| BEL | 1479 | 3.2% |
| NLD | 1259 | 2.7% |
| USA | 1189 | 2.6% |
| BRA | 1065 | 2.3% |
| Other values (140) | 9170 |
Length
| Value | Count | Frequency (%) |
| prt | 10879 | |
| fra | 7081 | |
| deu | 5012 | |
| gbr | 3753 | 8.1% |
| esp | 3285 | 7.1% |
| ita | 2054 | 4.4% |
| bel | 1479 | 3.2% |
| nld | 1259 | 2.7% |
| usa | 1189 | 2.6% |
| bra | 1065 | 2.3% |
| Other values (140) | 9170 |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 25823 | |
| P | 14846 | |
| T | 14105 | |
| A | 13122 | |
| E | 11637 | |
| U | 8296 | 6.0% |
| F | 7366 | 5.3% |
| D | 6720 | 4.9% |
| B | 6543 | 4.7% |
| S | 6341 | 4.6% |
| Other values (16) | 23468 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 138267 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 25823 | |
| P | 14846 | |
| T | 14105 | |
| A | 13122 | |
| E | 11637 | |
| U | 8296 | 6.0% |
| F | 7366 | 5.3% |
| D | 6720 | 4.9% |
| B | 6543 | 4.7% |
| S | 6341 | 4.6% |
| Other values (16) | 23468 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 138267 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 25823 | |
| P | 14846 | |
| T | 14105 | |
| A | 13122 | |
| E | 11637 | |
| U | 8296 | 6.0% |
| F | 7366 | 5.3% |
| D | 6720 | 4.9% |
| B | 6543 | 4.7% |
| S | 6341 | 4.6% |
| Other values (16) | 23468 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 138267 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 25823 | |
| P | 14846 | |
| T | 14105 | |
| A | 13122 | |
| E | 11637 | |
| U | 8296 | 6.0% |
| F | 7366 | 5.3% |
| D | 6720 | 4.9% |
| B | 6543 | 4.7% |
| S | 6341 | 4.6% |
| Other values (16) | 23468 |
market_segment
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 361.3 KiB |
| Online TA | |
|---|---|
| Offline TA/TO | |
| Direct | |
| Groups | |
| Corporate | 2345 |
| Other values (2) | 663 |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 9.256467942 |
| Min length | 6 |
Characters and Unicode
| Total characters | 427908 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Offline TA/TO |
|---|---|
| 2nd row | Groups |
| 3rd row | Groups |
| 4th row | Groups |
| 5th row | Groups |
Common Values
| Value | Count | Frequency (%) |
| Online TA | 24257 | |
| Offline TA/TO | 9574 | 20.7% |
| Direct | 5037 | 10.9% |
| Groups | 4352 | 9.4% |
| Corporate | 2345 | 5.1% |
| Complementary | 478 | 1.0% |
| Aviation | 185 | 0.4% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| online | 24257 | |
| ta | 24257 | |
| offline | 9574 | 12.0% |
| ta/to | 9574 | 12.0% |
| direct | 5037 | 6.3% |
| groups | 4352 | 5.4% |
| corporate | 2345 | 2.9% |
| complementary | 478 | 0.6% |
| aviation | 185 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 58751 | |
| O | 43405 | |
| T | 43405 | |
| e | 42169 | |
| i | 39238 | |
| l | 34309 | |
| A | 34016 | |
| 33831 | ||
| f | 19148 | 4.5% |
| r | 14557 | 3.4% |
| Other values (14) | 65079 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 251465 | |
| Uppercase Letter | 133038 | |
| Space Separator | 33831 | 7.9% |
| Other Punctuation | 9574 | 2.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 58751 | |
| e | 42169 | |
| i | 39238 | |
| l | 34309 | |
| f | 19148 | 7.6% |
| r | 14557 | 5.8% |
| o | 9705 | 3.9% |
| t | 8045 | 3.2% |
| p | 7175 | 2.9% |
| c | 5037 | 2.0% |
| Other values (6) | 13331 | 5.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 43405 | |
| T | 43405 | |
| A | 34016 | |
| D | 5037 | 3.8% |
| G | 4352 | 3.3% |
| C | 2823 | 2.1% |
Space Separator
| Value | Count | Frequency (%) |
| 33831 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 9574 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 384503 | |
| Common | 43405 | 10.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 58751 | |
| O | 43405 | |
| T | 43405 | |
| e | 42169 | |
| i | 39238 | |
| l | 34309 | |
| A | 34016 | |
| f | 19148 | 5.0% |
| r | 14557 | 3.8% |
| o | 9705 | 2.5% |
| Other values (12) | 45800 |
Common
| Value | Count | Frequency (%) |
| 33831 | ||
| / | 9574 | 22.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 427908 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 58751 | |
| O | 43405 | |
| T | 43405 | |
| e | 42169 | |
| i | 39238 | |
| l | 34309 | |
| A | 34016 | |
| 33831 | ||
| f | 19148 | 4.5% |
| r | 14557 | 3.4% |
| Other values (14) | 65079 |
distribution_channel
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 361.3 KiB |
| TA/TO | |
|---|---|
| Direct | |
| Corporate | 2622 |
| GDS | 156 |
Length
| Max length | 9 |
|---|---|
| Median length | 5 |
| Mean length | 5.340140175 |
| Min length | 3 |
Characters and Unicode
| Total characters | 246864 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | TA/TO |
|---|---|
| 2nd row | TA/TO |
| 3rd row | TA/TO |
| 4th row | TA/TO |
| 5th row | TA/TO |
Common Values
| Value | Count | Frequency (%) |
| TA/TO | 37902 | |
| Direct | 5548 | 12.0% |
| Corporate | 2622 | 5.7% |
| GDS | 156 | 0.3% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| ta/to | 37902 | |
| direct | 5548 | 12.0% |
| corporate | 2622 | 5.7% |
| gds | 156 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| T | 75804 | |
| A | 37902 | |
| / | 37902 | |
| O | 37902 | |
| r | 10792 | 4.4% |
| e | 8170 | 3.3% |
| t | 8170 | 3.3% |
| D | 5704 | 2.3% |
| i | 5548 | 2.2% |
| c | 5548 | 2.2% |
| Other values (6) | 13422 | 5.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 160246 | |
| Lowercase Letter | 48716 | 19.7% |
| Other Punctuation | 37902 | 15.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 10792 | |
| e | 8170 | |
| t | 8170 | |
| i | 5548 | |
| c | 5548 | |
| o | 5244 | |
| p | 2622 | 5.4% |
| a | 2622 | 5.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 75804 | |
| A | 37902 | |
| O | 37902 | |
| D | 5704 | 3.6% |
| C | 2622 | 1.6% |
| G | 156 | 0.1% |
| S | 156 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 37902 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 208962 | |
| Common | 37902 | 15.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 75804 | |
| A | 37902 | |
| O | 37902 | |
| r | 10792 | 5.2% |
| e | 8170 | 3.9% |
| t | 8170 | 3.9% |
| D | 5704 | 2.7% |
| i | 5548 | 2.7% |
| c | 5548 | 2.7% |
| o | 5244 | 2.5% |
| Other values (5) | 8178 | 3.9% |
Common
| Value | Count | Frequency (%) |
| / | 37902 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 246864 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| T | 75804 | |
| A | 37902 | |
| / | 37902 | |
| O | 37902 | |
| r | 10792 | 4.4% |
| e | 8170 | 3.3% |
| t | 8170 | 3.3% |
| D | 5704 | 2.3% |
| i | 5548 | 2.2% |
| c | 5548 | 2.2% |
| Other values (6) | 13422 | 5.4% |
is_repeated_guest
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 361.3 KiB |
| 0 | |
|---|---|
| 1 | 1591 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 46228 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 44637 | |
| 1 | 1591 | 3.4% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 44637 | |
| 1 | 1591 | 3.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 44637 | |
| 1 | 1591 | 3.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 46228 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 44637 | |
| 1 | 1591 | 3.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 46228 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 44637 | |
| 1 | 1591 | 3.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 46228 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 44637 | |
| 1 | 1591 | 3.4% |
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.02115600934 |
| Minimum | 0 |
|---|---|
| Maximum | 13 |
| Zeros | 45857 |
| Zeros (%) | 99.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 361.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 13 |
| Range | 13 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.336915105 |
|---|---|
| Coefficient of variation (CV) | 15.92526736 |
| Kurtosis | 718.655586 |
| Mean | 0.02115600934 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 24.57744197 |
| Sum | 978 |
| Variance | 0.113511788 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 45857 | |
| 1 | 191 | 0.4% |
| 2 | 59 | 0.1% |
| 3 | 44 | 0.1% |
| 11 | 25 | 0.1% |
| 4 | 21 | < 0.1% |
| 5 | 15 | < 0.1% |
| 6 | 15 | < 0.1% |
| 13 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 45857 | |
| 1 | 191 | 0.4% |
| 2 | 59 | 0.1% |
| 3 | 44 | 0.1% |
| 4 | 21 | < 0.1% |
| 5 | 15 | < 0.1% |
| 6 | 15 | < 0.1% |
| 11 | 25 | 0.1% |
| 13 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 13 | 1 | < 0.1% |
| 11 | 25 | 0.1% |
| 6 | 15 | < 0.1% |
| 5 | 15 | < 0.1% |
| 4 | 21 | < 0.1% |
| 3 | 44 | 0.1% |
| 2 | 59 | 0.1% |
| 1 | 191 | 0.4% |
| 0 | 45857 |
| Distinct | 73 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2085099939 |
| Minimum | 0 |
|---|---|
| Maximum | 72 |
| Zeros | 44761 |
| Zeros (%) | 96.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 361.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 72 |
| Range | 72 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.123927656 |
|---|---|
| Coefficient of variation (CV) | 10.18621513 |
| Kurtosis | 448.4554928 |
| Mean | 0.2085099939 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 18.62445817 |
| Sum | 9639 |
| Variance | 4.511068686 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 44761 | |
| 1 | 524 | 1.1% |
| 2 | 180 | 0.4% |
| 3 | 121 | 0.3% |
| 4 | 93 | 0.2% |
| 5 | 82 | 0.2% |
| 6 | 56 | 0.1% |
| 7 | 47 | 0.1% |
| 8 | 36 | 0.1% |
| 9 | 36 | 0.1% |
| Other values (63) | 292 | 0.6% |
| Value | Count | Frequency (%) |
| 0 | 44761 | |
| 1 | 524 | 1.1% |
| 2 | 180 | 0.4% |
| 3 | 121 | 0.3% |
| 4 | 93 | 0.2% |
| 5 | 82 | 0.2% |
| 6 | 56 | 0.1% |
| 7 | 47 | 0.1% |
| 8 | 36 | 0.1% |
| 9 | 36 | 0.1% |
| Value | Count | Frequency (%) |
| 72 | 1 | |
| 71 | 1 | |
| 70 | 1 | |
| 69 | 1 | |
| 68 | 1 | |
| 67 | 1 | |
| 66 | 1 | |
| 65 | 1 | |
| 64 | 1 | |
| 63 | 1 |
reserved_room_type
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 361.3 KiB |
| A | |
|---|---|
| D | |
| F | 1091 |
| E | 1048 |
| B | 747 |
| Other values (2) | 374 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 46228 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | A |
|---|---|
| 2nd row | A |
| 3rd row | A |
| 4th row | A |
| 5th row | A |
Common Values
| Value | Count | Frequency (%) |
| A | 35347 | |
| D | 7621 | 16.5% |
| F | 1091 | 2.4% |
| E | 1048 | 2.3% |
| B | 747 | 1.6% |
| G | 365 | 0.8% |
| C | 9 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| a | 35347 | |
| d | 7621 | 16.5% |
| f | 1091 | 2.4% |
| e | 1048 | 2.3% |
| b | 747 | 1.6% |
| g | 365 | 0.8% |
| c | 9 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 35347 | |
| D | 7621 | 16.5% |
| F | 1091 | 2.4% |
| E | 1048 | 2.3% |
| B | 747 | 1.6% |
| G | 365 | 0.8% |
| C | 9 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 46228 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 35347 | |
| D | 7621 | 16.5% |
| F | 1091 | 2.4% |
| E | 1048 | 2.3% |
| B | 747 | 1.6% |
| G | 365 | 0.8% |
| C | 9 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 46228 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 35347 | |
| D | 7621 | 16.5% |
| F | 1091 | 2.4% |
| E | 1048 | 2.3% |
| B | 747 | 1.6% |
| G | 365 | 0.8% |
| C | 9 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 46228 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 35347 | |
| D | 7621 | 16.5% |
| F | 1091 | 2.4% |
| E | 1048 | 2.3% |
| B | 747 | 1.6% |
| G | 365 | 0.8% |
| C | 9 | < 0.1% |
assigned_room_type
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 361.3 KiB |
| A | |
|---|---|
| D | |
| E | 1628 |
| B | 1501 |
| F | 1299 |
| Other values (3) | 984 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 46228 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | A |
|---|---|
| 2nd row | A |
| 3rd row | A |
| 4th row | A |
| 5th row | A |
Common Values
| Value | Count | Frequency (%) |
| A | 30106 | |
| D | 10710 | 23.2% |
| E | 1628 | 3.5% |
| B | 1501 | 3.2% |
| F | 1299 | 2.8% |
| G | 571 | 1.2% |
| K | 267 | 0.6% |
| C | 146 | 0.3% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| a | 30106 | |
| d | 10710 | 23.2% |
| e | 1628 | 3.5% |
| b | 1501 | 3.2% |
| f | 1299 | 2.8% |
| g | 571 | 1.2% |
| k | 267 | 0.6% |
| c | 146 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 30106 | |
| D | 10710 | 23.2% |
| E | 1628 | 3.5% |
| B | 1501 | 3.2% |
| F | 1299 | 2.8% |
| G | 571 | 1.2% |
| K | 267 | 0.6% |
| C | 146 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 46228 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 30106 | |
| D | 10710 | 23.2% |
| E | 1628 | 3.5% |
| B | 1501 | 3.2% |
| F | 1299 | 2.8% |
| G | 571 | 1.2% |
| K | 267 | 0.6% |
| C | 146 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 46228 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 30106 | |
| D | 10710 | 23.2% |
| E | 1628 | 3.5% |
| B | 1501 | 3.2% |
| F | 1299 | 2.8% |
| G | 571 | 1.2% |
| K | 267 | 0.6% |
| C | 146 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 46228 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 30106 | |
| D | 10710 | 23.2% |
| E | 1628 | 3.5% |
| B | 1501 | 3.2% |
| F | 1299 | 2.8% |
| G | 571 | 1.2% |
| K | 267 | 0.6% |
| C | 146 | 0.3% |
| Distinct | 21 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2643635892 |
| Minimum | 0 |
|---|---|
| Maximum | 21 |
| Zeros | 37639 |
| Zeros (%) | 81.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 361.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 21 |
| Range | 21 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.7097132655 |
|---|---|
| Coefficient of variation (CV) | 2.684610493 |
| Kurtosis | 99.22564145 |
| Mean | 0.2643635892 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.575270874 |
| Sum | 12221 |
| Variance | 0.5036929192 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 37639 | |
| 1 | 6238 | 13.5% |
| 2 | 1713 | 3.7% |
| 3 | 380 | 0.8% |
| 4 | 154 | 0.3% |
| 5 | 34 | 0.1% |
| 6 | 21 | < 0.1% |
| 7 | 18 | < 0.1% |
| 8 | 7 | < 0.1% |
| 14 | 4 | < 0.1% |
| Other values (11) | 20 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 37639 | |
| 1 | 6238 | 13.5% |
| 2 | 1713 | 3.7% |
| 3 | 380 | 0.8% |
| 4 | 154 | 0.3% |
| 5 | 34 | 0.1% |
| 6 | 21 | < 0.1% |
| 7 | 18 | < 0.1% |
| 8 | 7 | < 0.1% |
| 9 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 21 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 18 | 1 | < 0.1% |
| 17 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 15 | 3 | |
| 14 | 4 | |
| 13 | 3 | |
| 12 | 1 | < 0.1% |
| 11 | 2 |
deposit_type
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 361.3 KiB |
| No Deposit | |
|---|---|
| Non Refund | 24 |
| Refundable | 6 |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 462280 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | No Deposit |
|---|---|
| 2nd row | No Deposit |
| 3rd row | No Deposit |
| 4th row | No Deposit |
| 5th row | No Deposit |
Common Values
| Value | Count | Frequency (%) |
| No Deposit | 46198 | |
| Non Refund | 24 | 0.1% |
| Refundable | 6 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| no | 46198 | |
| deposit | 46198 | |
| non | 24 | < 0.1% |
| refund | 24 | < 0.1% |
| refundable | 6 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 92420 | |
| e | 46234 | |
| N | 46222 | |
| 46222 | ||
| s | 46198 | |
| i | 46198 | |
| t | 46198 | |
| p | 46198 | |
| D | 46198 | |
| n | 54 | < 0.1% |
| Other values (7) | 138 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 323608 | |
| Uppercase Letter | 92450 | 20.0% |
| Space Separator | 46222 | 10.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 92420 | |
| e | 46234 | |
| s | 46198 | |
| i | 46198 | |
| t | 46198 | |
| p | 46198 | |
| n | 54 | < 0.1% |
| f | 30 | < 0.1% |
| u | 30 | < 0.1% |
| d | 30 | < 0.1% |
| Other values (3) | 18 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 46222 | |
| D | 46198 | |
| R | 30 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 46222 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 416058 | |
| Common | 46222 | 10.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 92420 | |
| e | 46234 | |
| N | 46222 | |
| s | 46198 | |
| i | 46198 | |
| t | 46198 | |
| p | 46198 | |
| D | 46198 | |
| n | 54 | < 0.1% |
| R | 30 | < 0.1% |
| Other values (6) | 108 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 46222 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 462280 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 92420 | |
| e | 46234 | |
| N | 46222 | |
| 46222 | ||
| s | 46198 | |
| i | 46198 | |
| t | 46198 | |
| p | 46198 | |
| D | 46198 | |
| n | 54 | < 0.1% |
| Other values (7) | 138 | < 0.1% |
| Distinct | 202 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 5522 |
| Missing (%) | 11.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28.08151133 |
| Minimum | 1 |
|---|---|
| Maximum | 509 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 361.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 9 |
| median | 9 |
| Q3 | 14 |
| 95-th percentile | 152 |
| Maximum | 509 |
| Range | 508 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 56.32166282 |
|---|---|
| Coefficient of variation (CV) | 2.005649275 |
| Kurtosis | 21.39263731 |
| Mean | 28.08151133 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 4.227358035 |
| Sum | 1143086 |
| Variance | 3172.129703 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 18693 | |
| 7 | 3065 | 6.6% |
| 14 | 2988 | 6.5% |
| 1 | 1907 | 4.1% |
| 6 | 1717 | 3.7% |
| 28 | 1556 | 3.4% |
| 8 | 848 | 1.8% |
| 3 | 541 | 1.2% |
| 37 | 513 | 1.1% |
| 83 | 507 | 1.1% |
| Other values (192) | 8371 | |
| (Missing) | 5522 | 11.9% |
| Value | Count | Frequency (%) |
| 1 | 1907 | 4.1% |
| 2 | 33 | 0.1% |
| 3 | 541 | 1.2% |
| 4 | 16 | < 0.1% |
| 6 | 1717 | 3.7% |
| 7 | 3065 | 6.6% |
| 8 | 848 | 1.8% |
| 9 | 18693 | |
| 10 | 186 | 0.4% |
| 11 | 260 | 0.6% |
| Value | Count | Frequency (%) |
| 509 | 8 | |
| 495 | 7 | |
| 484 | 10 | |
| 480 | 1 | < 0.1% |
| 476 | 1 | < 0.1% |
| 475 | 8 | |
| 474 | 17 | |
| 467 | 12 | |
| 464 | 1 | < 0.1% |
| 461 | 2 | < 0.1% |
| Distinct | 192 |
|---|---|
| Distinct (%) | 6.6% |
| Missing | 43323 |
| Missing (%) | 93.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 143.1958692 |
| Minimum | 8 |
|---|---|
| Maximum | 497 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 361.3 KiB |
Quantile statistics
| Minimum | 8 |
|---|---|
| 5-th percentile | 40 |
| Q1 | 40 |
| median | 91 |
| Q3 | 219 |
| 95-th percentile | 408 |
| Maximum | 497 |
| Range | 489 |
| Interquartile range (IQR) | 179 |
Descriptive statistics
| Standard deviation | 119.9337592 |
|---|---|
| Coefficient of variation (CV) | 0.8375504117 |
| Kurtosis | 0.19955487 |
| Mean | 143.1958692 |
| Median Absolute Deviation (MAD) | 51 |
| Skewness | 1.047480589 |
| Sum | 415984 |
| Variance | 14384.10659 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40 | 847 | 1.8% |
| 45 | 222 | 0.5% |
| 153 | 167 | 0.4% |
| 219 | 132 | 0.3% |
| 233 | 103 | 0.2% |
| 174 | 99 | 0.2% |
| 67 | 92 | 0.2% |
| 242 | 59 | 0.1% |
| 51 | 52 | 0.1% |
| 91 | 44 | 0.1% |
| Other values (182) | 1088 | 2.4% |
| (Missing) | 43323 |
| Value | Count | Frequency (%) |
| 8 | 1 | < 0.1% |
| 9 | 14 | < 0.1% |
| 11 | 1 | < 0.1% |
| 14 | 6 | < 0.1% |
| 18 | 1 | < 0.1% |
| 35 | 1 | < 0.1% |
| 38 | 37 | 0.1% |
| 40 | 847 | |
| 45 | 222 | 0.5% |
| 46 | 25 | 0.1% |
| Value | Count | Frequency (%) |
| 497 | 1 | < 0.1% |
| 494 | 1 | < 0.1% |
| 492 | 2 | < 0.1% |
| 491 | 1 | < 0.1% |
| 489 | 1 | < 0.1% |
| 486 | 1 | < 0.1% |
| 485 | 13 | |
| 483 | 2 | < 0.1% |
| 481 | 1 | < 0.1% |
| 479 | 1 | < 0.1% |
| Distinct | 74 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.150082201 |
| Minimum | 0 |
|---|---|
| Maximum | 379 |
| Zeros | 45127 |
| Zeros (%) | 97.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 361.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 379 |
| Range | 379 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 17.57645979 |
|---|---|
| Coefficient of variation (CV) | 8.174785028 |
| Kurtosis | 148.3738321 |
| Mean | 2.150082201 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 11.1532541 |
| Sum | 99394 |
| Variance | 308.9319387 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 45127 | |
| 58 | 164 | 0.4% |
| 87 | 76 | 0.2% |
| 63 | 51 | 0.1% |
| 38 | 47 | 0.1% |
| 176 | 39 | 0.1% |
| 77 | 37 | 0.1% |
| 223 | 36 | 0.1% |
| 48 | 33 | 0.1% |
| 98 | 30 | 0.1% |
| Other values (64) | 588 | 1.3% |
| Value | Count | Frequency (%) |
| 0 | 45127 | |
| 1 | 5 | < 0.1% |
| 2 | 2 | < 0.1% |
| 4 | 14 | < 0.1% |
| 5 | 2 | < 0.1% |
| 6 | 12 | < 0.1% |
| 7 | 1 | < 0.1% |
| 9 | 2 | < 0.1% |
| 10 | 1 | < 0.1% |
| 11 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 379 | 6 | < 0.1% |
| 330 | 14 | < 0.1% |
| 259 | 10 | < 0.1% |
| 236 | 29 | |
| 224 | 4 | < 0.1% |
| 223 | 36 | |
| 215 | 8 | < 0.1% |
| 207 | 10 | < 0.1% |
| 187 | 22 | |
| 178 | 25 |
customer_type
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 361.3 KiB |
| Transient | |
|---|---|
| Transient-Party | |
| Contract | 1195 |
| Group | 264 |
Length
| Max length | 15 |
|---|---|
| Median length | 9 |
| Mean length | 10.56889764 |
| Min length | 5 |
Characters and Unicode
| Total characters | 488579 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Transient |
|---|---|
| 2nd row | Transient-Party |
| 3rd row | Transient-Party |
| 4th row | Transient-Party |
| 5th row | Transient-Party |
Common Values
| Value | Count | Frequency (%) |
| Transient | 32306 | |
| Transient-Party | 12463 | 27.0% |
| Contract | 1195 | 2.6% |
| Group | 264 | 0.6% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| transient | 32306 | |
| transient-party | 12463 | 27.0% |
| contract | 1195 | 2.6% |
| group | 264 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 90733 | |
| t | 59622 | |
| r | 58691 | |
| a | 58427 | |
| T | 44769 | |
| s | 44769 | |
| i | 44769 | |
| e | 44769 | |
| y | 12463 | 2.6% |
| - | 12463 | 2.6% |
| Other values (7) | 17104 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 417425 | |
| Uppercase Letter | 58691 | 12.0% |
| Dash Punctuation | 12463 | 2.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 90733 | |
| t | 59622 | |
| r | 58691 | |
| a | 58427 | |
| s | 44769 | |
| i | 44769 | |
| e | 44769 | |
| y | 12463 | 3.0% |
| o | 1459 | 0.3% |
| c | 1195 | 0.3% |
| Other values (2) | 528 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 44769 | |
| P | 12463 | 21.2% |
| C | 1195 | 2.0% |
| G | 264 | 0.4% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 12463 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 476116 | |
| Common | 12463 | 2.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 90733 | |
| t | 59622 | |
| r | 58691 | |
| a | 58427 | |
| T | 44769 | |
| s | 44769 | |
| i | 44769 | |
| e | 44769 | |
| y | 12463 | 2.6% |
| P | 12463 | 2.6% |
| Other values (6) | 4641 | 1.0% |
Common
| Value | Count | Frequency (%) |
| - | 12463 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 488579 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 90733 | |
| t | 59622 | |
| r | 58691 | |
| a | 58427 | |
| T | 44769 | |
| s | 44769 | |
| i | 44769 | |
| e | 44769 | |
| y | 12463 | 2.6% |
| - | 12463 | 2.6% |
| Other values (7) | 17104 | 3.5% |
required_car_parking_spaces
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 361.3 KiB |
| 0 | |
|---|---|
| 1 | 1921 |
| 2 | 3 |
| 3 | 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 46228 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 44302 | |
| 1 | 1921 | 4.2% |
| 2 | 3 | < 0.1% |
| 3 | 2 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 44302 | |
| 1 | 1921 | 4.2% |
| 2 | 3 | < 0.1% |
| 3 | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 44302 | |
| 1 | 1921 | 4.2% |
| 2 | 3 | < 0.1% |
| 3 | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 46228 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 44302 | |
| 1 | 1921 | 4.2% |
| 2 | 3 | < 0.1% |
| 3 | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 46228 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 44302 | |
| 1 | 1921 | 4.2% |
| 2 | 3 | < 0.1% |
| 3 | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 46228 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 44302 | |
| 1 | 1921 | 4.2% |
| 2 | 3 | < 0.1% |
| 3 | 2 | < 0.1% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.7410876525 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 21617 |
| Zeros (%) | 46.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 361.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.8338522811 |
|---|---|
| Coefficient of variation (CV) | 1.125173626 |
| Kurtosis | 0.7991322246 |
| Mean | 0.7410876525 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.02196617 |
| Sum | 34259 |
| Variance | 0.6953096267 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 21617 | |
| 1 | 16699 | |
| 2 | 6403 | 13.9% |
| 3 | 1307 | 2.8% |
| 4 | 177 | 0.4% |
| 5 | 25 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 21617 | |
| 1 | 16699 | |
| 2 | 6403 | 13.9% |
| 3 | 1307 | 2.8% |
| 4 | 177 | 0.4% |
| 5 | 25 | 0.1% |
| Value | Count | Frequency (%) |
| 5 | 25 | 0.1% |
| 4 | 177 | 0.4% |
| 3 | 1307 | 2.8% |
| 2 | 6403 | 13.9% |
| 1 | 16699 | |
| 0 | 21617 |
stays_total
Real number (ℝ≥0)
| Distinct | 33 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.923617721 |
| Minimum | 0 |
|---|---|
| Maximum | 57 |
| Zeros | 308 |
| Zeros (%) | 0.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 361.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 57 |
| Range | 57 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.76218977 |
|---|---|
| Coefficient of variation (CV) | 0.6027428818 |
| Kurtosis | 55.05213447 |
| Mean | 2.923617721 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 3.406518854 |
| Sum | 135153 |
| Variance | 3.105312786 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 11895 | |
| 2 | 10992 | |
| 1 | 9169 | |
| 4 | 7704 | |
| 5 | 3221 | 7.0% |
| 7 | 1251 | 2.7% |
| 6 | 1116 | 2.4% |
| 0 | 308 | 0.7% |
| 8 | 209 | 0.5% |
| 9 | 120 | 0.3% |
| Other values (23) | 243 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 308 | 0.7% |
| 1 | 9169 | |
| 2 | 10992 | |
| 3 | 11895 | |
| 4 | 7704 | |
| 5 | 3221 | 7.0% |
| 6 | 1116 | 2.4% |
| 7 | 1251 | 2.7% |
| 8 | 209 | 0.5% |
| 9 | 120 | 0.3% |
| Value | Count | Frequency (%) |
| 57 | 1 | |
| 49 | 1 | |
| 48 | 1 | |
| 43 | 1 | |
| 34 | 1 | |
| 29 | 1 | |
| 28 | 1 | |
| 27 | 1 | |
| 24 | 1 | |
| 23 | 1 |
First rows
| df_index | lead_time | arrival_date_year | arrival_date_month | arrival_date_week_number | arrival_date_day_of_month | adults | children | babies | meal | country | market_segment | distribution_channel | is_repeated_guest | previous_cancellations | previous_bookings_not_canceled | reserved_room_type | assigned_room_type | booking_changes | deposit_type | agent | company | days_in_waiting_list | customer_type | required_car_parking_spaces | total_of_special_requests | stays_total | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 40060 | 6 | 2015 | 7 | 27 | 1 | 1 | 0.0 | 0 | HB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0.0 | No Deposit | 6.0 | nan | 0 | Transient | 0 | 0 | 2 |
| 1 | 40066 | 3 | 2015 | 7 | 27 | 2 | 1 | 0.0 | 0 | HB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 1.0 | No Deposit | 1.0 | nan | 0 | Transient-Party | 0 | 0 | 3 |
| 2 | 40070 | 43 | 2015 | 7 | 27 | 3 | 2 | 0.0 | 0 | HB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0.0 | No Deposit | 1.0 | nan | 0 | Transient-Party | 0 | 0 | 2 |
| 3 | 40071 | 43 | 2015 | 7 | 27 | 3 | 2 | 0.0 | 0 | HB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 1.0 | No Deposit | 1.0 | nan | 0 | Transient-Party | 0 | 0 | 2 |
| 4 | 40072 | 43 | 2015 | 7 | 27 | 3 | 2 | 0.0 | 0 | HB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0.0 | No Deposit | 1.0 | nan | 0 | Transient-Party | 0 | 0 | 2 |
| 5 | 40073 | 4 | 2015 | 7 | 27 | 3 | 1 | 0.0 | 0 | HB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0.0 | No Deposit | 1.0 | nan | 0 | Transient-Party | 0 | 0 | 2 |
| 6 | 40075 | 43 | 2015 | 7 | 27 | 3 | 1 | 0.0 | 0 | HB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 1.0 | No Deposit | 1.0 | nan | 0 | Transient-Party | 0 | 0 | 2 |
| 7 | 40077 | 43 | 2015 | 7 | 27 | 3 | 2 | 0.0 | 0 | HB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0.0 | No Deposit | 1.0 | nan | 0 | Transient-Party | 0 | 0 | 2 |
| 8 | 40078 | 43 | 2015 | 7 | 27 | 3 | 2 | 0.0 | 0 | HB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0.0 | No Deposit | 1.0 | nan | 0 | Transient-Party | 0 | 0 | 2 |
| 9 | 40082 | 43 | 2015 | 7 | 27 | 3 | 2 | 0.0 | 0 | HB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0.0 | No Deposit | 1.0 | nan | 0 | Transient-Party | 0 | 0 | 2 |
Last rows
| df_index | lead_time | arrival_date_year | arrival_date_month | arrival_date_week_number | arrival_date_day_of_month | adults | children | babies | meal | country | market_segment | distribution_channel | is_repeated_guest | previous_cancellations | previous_bookings_not_canceled | reserved_room_type | assigned_room_type | booking_changes | deposit_type | agent | company | days_in_waiting_list | customer_type | required_car_parking_spaces | total_of_special_requests | stays_total | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 46218 | 119380 | 44 | 2017 | 8 | 35 | 31 | 2 | 0.0 | 0 | SC | DEU | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0.0 | No Deposit | 9.0 | nan | 0 | Transient | 0 | 1 | 4 |
| 46219 | 119381 | 188 | 2017 | 8 | 35 | 31 | 2 | 0.0 | 0 | BB | DEU | Direct | Direct | 0 | 0 | 0 | A | A | 0.0 | No Deposit | 14.0 | nan | 0 | Transient | 0 | 0 | 5 |
| 46220 | 119382 | 135 | 2017 | 8 | 35 | 30 | 3 | 0.0 | 0 | BB | JPN | Online TA | TA/TO | 0 | 0 | 0 | G | G | 0.0 | No Deposit | 7.0 | nan | 0 | Transient | 0 | 0 | 6 |
| 46221 | 119383 | 164 | 2017 | 8 | 35 | 31 | 2 | 0.0 | 0 | BB | DEU | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0.0 | No Deposit | 42.0 | nan | 0 | Transient | 0 | 0 | 6 |
| 46222 | 119384 | 21 | 2017 | 8 | 35 | 30 | 2 | 0.0 | 0 | BB | BEL | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0.0 | No Deposit | 394.0 | nan | 0 | Transient | 0 | 2 | 7 |
| 46223 | 119385 | 23 | 2017 | 8 | 35 | 30 | 2 | 0.0 | 0 | BB | BEL | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0.0 | No Deposit | 394.0 | nan | 0 | Transient | 0 | 0 | 7 |
| 46224 | 119386 | 102 | 2017 | 8 | 35 | 31 | 3 | 0.0 | 0 | BB | FRA | Online TA | TA/TO | 0 | 0 | 0 | E | E | 0.0 | No Deposit | 9.0 | nan | 0 | Transient | 0 | 2 | 7 |
| 46225 | 119387 | 34 | 2017 | 8 | 35 | 31 | 2 | 0.0 | 0 | BB | DEU | Online TA | TA/TO | 0 | 0 | 0 | D | D | 0.0 | No Deposit | 9.0 | nan | 0 | Transient | 0 | 4 | 7 |
| 46226 | 119388 | 109 | 2017 | 8 | 35 | 31 | 2 | 0.0 | 0 | BB | GBR | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0.0 | No Deposit | 89.0 | nan | 0 | Transient | 0 | 0 | 7 |
| 46227 | 119389 | 205 | 2017 | 8 | 35 | 29 | 2 | 0.0 | 0 | HB | DEU | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0.0 | No Deposit | 9.0 | nan | 0 | Transient | 0 | 2 | 9 |